This archive contains replication data for a bibliometric study, titled "Diversity, Inequality, and Opportunity Structure in Political Science: What 50 Years of Bibliometric Data Tells Us", authored by Yuner ZHU & Edmund Wai CHENG.

This study examines the changing patterns of knowledge production and diffusion in political science over the past five decades using a dataset of over 200,000 SSCIindexed research articles from 1970 to 2020. We analyze how author identity and four types of team diversity (namely, gender, ethnic, regional, and reference diversity) influence research outputs and outcomes. 

As per journal requirements, we have attached the original data used in this paper (regression_table_anonymous_final.csv, journal_scatter.csv, and timelines.csv) as well as the log files from our analysis using Python (Regression.ipynb and Visualizations.ipynb).
All analyses were conducted in Python with the support of several computational packages, including pandas, statsmodels, and sklearn.
The log files are produced by Jupyter Notebook, which display the numbers and coefficients used in all tables and figures.  